Identification of activated cryptic 5′ splice sites using structure profiles and odds measure

نویسندگان

  • Kun-Nan Tsai
  • Daryi Wang
چکیده

The activation of cryptic 5' splice sites (5' SSs) is often related to human hereditary diseases. The DNA-based mutation screening strategies are commonly used to recognize the cryptic 5' SSs, because features of the local DNA sequence can influence the choice of cryptic 5' SSs. To improve the identification of the cryptic 5' SSs, we developed a structure-based method, named SPO (structure profiles and odds measure), which combines two parameters, the structural feature derived from hydroxyl radical cleavage pattern and odds measure, to assess the likelihood of a cryptic 5' SS activation in competing with its paired authentic 5' SS. Compared to the current tools for identifying activated cryptic 5' SSs, the SPO algorithm achieves higher prediction accuracy than the other methods, including MaxEnt, MDD, Markov model, weight matrix model, Shapiro and Senapathy matrix, R(i) and ΔG. In addition, the predicted ΔSPO scores from the SPO algorithm exhibited a greater degree of correlation with the strength of cryptic 5' SS activation than that measured from the other seven methods. In conclusion, the SPO algorithm provides an optimal identification of cryptic 5' SSs, can be applied in designing mutagenesis experiments for various splicing events and may be helpful to investigate the relationship between structural variants and human hereditary diseases.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Cryptic intron activation within the large exon of the mouse polymeric immunoglobulin receptor gene: cryptic splice sites correspond to protein domain boundaries.

The fourth exon of the mouse polymeric immuno-globulin receptor (pIgR) is 654 nt long and, despite being surrounded by large introns, is constitutively spliced into the mRNA. Deletion of an 84 nt sequence from this exon strongly activated both cryptic 5' and 3' splice sites surrounding a 78 nt cryptic intron. The 84 nt deletion is just upstream of the cryptic 3' splice site; the cryptic 3' spli...

متن کامل

Aberrant 3′ splice sites in human disease genes: mutation pattern, nucleotide structure and comparison of computational tools that predict their utilization

The frequency distribution of mutation-induced aberrant 3' splice sites (3'ss) in exons and introns is more complex than for 5' splice sites, largely owing to sequence constraints upstream of intron/exon boundaries. As a result, prediction of their localization remains a challenging task. Here, nucleotide sequences of previously reported 218 aberrant 3'ss activated by disease-causing mutations ...

متن کامل

Mechanism for cryptic splice site activation during pre-mRNA splicing.

The 5' splice site of a pre-mRNA is recognized by U1 small nuclear ribonucleoprotein particles (snRNP) through base pairing with the 5' end of U1 small nuclear RNA (snRNA). Single-base substitutions within a 9-nucleotide 5'-splice-site sequence can abolish or attenuate use of that site and, in higher eukaryotes, can also activate nearby "cryptic" 5' splice sites. Here we show that the effects o...

متن کامل

Aberrant 5′ splice sites in human disease genes: mutation pattern, nucleotide structure and comparison of computational tools that predict their utilization

Despite a growing number of splicing mutations found in hereditary diseases, utilization of aberrant splice sites and their effects on gene expression remain challenging to predict. We compiled sequences of 346 aberrant 5'splice sites (5'ss) that were activated by mutations in 166 human disease genes. Mutations within the 5'ss consensus accounted for 254 cryptic 5'ss and mutations elsewhere act...

متن کامل

DBASS3 and DBASS5: databases of aberrant 3′- and 5′-splice sites

DBASS3 and DBASS5 provide comprehensive repositories of new exon boundaries that were induced by pathogenic mutations in human disease genes. Aberrant 5'- and 3'-splice sites were activated either by mutations in the consensus sequences of natural exon-intron junctions (cryptic sites) or elsewhere ('de novo' sites). DBASS3 and DBASS5 currently contain approximately 900 records of cryptic and de...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 40  شماره 

صفحات  -

تاریخ انتشار 2012